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ABSTRACT 

The spliceosome undergoes major changes in protein and RNA composition during pre-mRNA splicing. Knowing the proteins — 
and their respective quantities — at each spliceosomal assembly stage is critical for understanding the molecular mechanisms and 
regulation of splicing. Here, we applied three independent mass spectrometry (MS)-based approaches for quantification of these 
proteins: (1) metabolic labeling by SILAC, (2) chemical labeling by iTRAQ, and (3) label-free spectral count for quantification of 
the protein composition of the human spliceosomal precatalytic B and catalytic C complexes. In total we were able to quantify 157 
proteins by at least two of the three approaches. Our quantification shows that only a very small subset of spliceosomal proteins 
(the U5 and U2 Sm proteins, a subset of U5 snRNP-specific proteins, and the U2 snRNP-specific proteins U2A' and U2B") remains 
unaltered upon transition from the B to the C complex. The MS-based quantification approaches classify the majority of proteins as 
dynamically associated specifically with the B or the C complex. In terms of experimental procedure and the methodical aspect of 
this work, we show that metabolically labeled spliceosomes are functionally active in terms of their assembly and splicing kinetics 
and can be utilized for quantitative studies. Moreover, we obtain consistent quantification results from all three methods, 
including the relatively straightforward and inexpensive label-free spectral count technique. 

Keywords: spliceosome; quantitative proteomics; stable isotope labeling with amino acids in cell culture (SILAC); isobaric tags for 
relative and absolute quantification (iTRAQ); spectral count 



INTRODUCTION 

Proteomic analysis provides essential information about the 
function and regulation of protein complexes. Numerous 
mass spectrometry (MS) techniques have been developed 
to determine protein composition and their quantities 
(Aebersold and Mann 2003; Steen and Mann 2004; Ong 
and Mann 2005; Domon and Aebersold 2006; Yates et al. 
2009; Bantscheff et al. 2012; Nikolov et al. 2012). To date, 
the relative quantification technique of stable-isotope label- 
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ing followed by MS has frequently been used to quantify pro- 
teins from tissues, from different functional states of a cell, 
from compartments or from ptill-down experiments involv- 
ing protein(-ligand) complexes. However, they have rarely 
been used to compare directly isolated native protein com- 
plexes that represent different functional states of a single 
molecular machine. Here, we examine the proteomes of iso- 
lated human precatalytic B and catalytically active C spliceo- 
somal complexes by quantitative MS. 

The spliceosome catalyzes eukaryotic pre-mRNA splicing 
to generate mature mRNAs and is a highly dynamic and com- 
plex macromolecular machine. Spliceosomes consist of one 
or more of the five uridine-rich small nuclear ribonucleopro- 
tein particles (snRNPs; Ul, U2, U5, the U4/U6 di-snRNP, and 
the U4/U6.U5 tri-snRNP), each of which contains U snRNA 
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(s), a core set of seven Sm or LSm proteins, and numerous U 
snRNP-specific proteins. In vitro, several different functional 
states of the spliceosome can be distinguished; these are 
termed the E, A, B, B""', and C complexes (Wahl et al. 
2009). Transitions between these different states are often ac- 
companied by dramatic rearrangements in the spliceosomal 
interactions. For instance, the precatalytic B complex, which 
contains the U4/U6.U5 tri-snRNP, undergoes structural rear- 
rangements that destabilize the Ul and U4 snRNPs to form 
the activated B'"^' complex; this then catalyzes the first step of 
splicing to generate the catalytically active C complex, which 
in turn catalyzes the second step of splicing to produce the ma- 
ture mRNA (Wahl et al. 2009). Determining the protein com- 
position of each of the spliceosomal states has been a major 
goal in the past decade. Initial studies using active spliceo- 
somes assembled in vitro revealed that in addition to the 
U snRNP-specific proteins, more than 100 proteins are in- 
volved in the splicing cycle (Jurica et al. 2002; Rappsilber 
et al. 2002; Zhou et al. 2002; Jurica and Moore 2003) . More de- 
tailed studies focused on analyzing the protein composition of 
the isolated functional states of the A, B, B"^', and C complexes 
and the mRNPs in humans (Deckert et al. 2006; Behzadnia 
et al. 2007; Bessonov et al. 2008, 2010), yeast (B, B"^*, and C 
complexes) (Fabrizio et al. 2009), and Drosophila (B and C 
complexes) (Herold et al. 2009). MS analyses have revealed 
that the human B and C complexes contain approximately 
130 and 150 proteins, respectively. Of these, 105 proteins are 
stably associated with both complexes (Bessonov et al. 
2008). The substantial change in composition from one func- 
tional spliceosomal state to another has been clearly shown 
by the determination of the protein compositions of isolated 
A, B, B"^', and C complexes, each of which differs remarkably 
from the others even when isolated under native conditions 
that allow close comparison (Deckert et al. 2006; Behzadnia 
et al. 2007; Bessonov et al. 2008, 2010; Fabrizio et al. 2009; 
Herold et al. 2009) . By using a novel 2D gel electrophoresis sys- 
tem to quantify spliceosomal complexes, it has recently been 
shown that only 60-70 protein factors are moderately or high- 
ly abundant in the various spliceosomal complexes (Agafonov 
et al. 201 1). In these previous studies, the protein components 
of isolated spliceosomes were analyzed in a so-called "semi- 
quantitative" manner by determining the number of se- 
quenced peptides by MS (peptide count) or by measuring 
the intensity of stained proteins after 2D gel electrophoresis 
to determine the relative and absolute abundances, respec- 
tively, of the proteins (Deckert et al. 2006; Behzadnia et al. 
2007; Merz et al. 2007; Bessonov et al. 2008, 2010; Fabrizio 
et al. 2009; Agafonov et al. 2011). We applied now here — 
for the first time — three independent MS-based quantifica- 
tion methods (SILAC metabolic labeling [Ong et al. 2002], 
iTRAQ chemical labeling [Ross et al. 2004], and the label- 
free spectral count [Liu et al. 2004]) in determining the pro- 
tein quantities in purified spliceosomal B and C complexes. 

We identified and quantified approximately 200 proteins 
in both the B and the C complex preparations; this represents 



>95% of the previously published proteomes of the B and C 
complexes (Deckert et al. 2006; Bessonov et al. 2008). 
Strikingly, only a few of the proteins identified are "core" 
components of both complexes, the quantities of which do 
not change upon transition from B-to-C complex. These 
were a subset of U5 snRNP proteins, the U2-specific proteins 
U2A' and U2B", and the evolutionarUy conserved Sm pro- 
teins of U5 and U2. Most proteins, in contrast, either joined 
or left the spliceosome during its transition from the precata- 
lytic B complex to the catalytically active C complex. This 
study expands our previous investigation of the proteomes 
of spliceosomal B and C complexes and may further be ex- 
tended to study the assembly kinetics of spliceosomal com- 
plexes in nuclear extract (NE). 

RESULTS 

For our quantitative MS studies, we used the same experimen- 
tal approach as Bessonov et al. (2008), namely, separation of 
the protein components of glycerol-gradient-purified spli- 
ceosomal complexes by ID gel electrophoresis, and thus re- 
duced the sample complexity to a similar extent (Bessonov 
et al. 2008). Importantly, as a prerequisite of such an MS- 
based quantitative analysis, namely, comparing the exact 
amount of natively purified assemblies, we benefit from the 
measurement of the amount of ^^P-labeled pre-mRNA that 
is bound to the respective complexes. All quantitative MS- 
based analyses were performed with two biological replicates. 
We resumed the protein assignment previously introduced by 
Deckert et al. (2006), Bessonov et al. (2008), and Agafonov 
et al. (2011) and adjusted the protein classification of B and 
C complexes according to our quantification results (Table 
1). In addition, we list some proteins that could not unambig- 
uously be assigned to either of the two complexes, owing to 
their fluctuating quantification results (Supplemental Table 
1). Finally, we identified and quantified several proteins that 
clearly show association with either the B or the C complex 
by all three approaches and were not classified as B- or C-spe- 
cific proteins previously by Bessonov etal. (2008) or Agafonov 
etal. (2011); Table 1. 

MS analysis of metabolically stable isotope- labeled 
(SILAC) spliceosomes 

Metabolic labeling with stable isotopes (Ong et al. 2002) is 
considered to be the gold standard in protein quantification; 
as proteins are fully labeled, samples can be pooled at an early 
stage of sample preparation and cleavage of proteins with 
endoproteinase trypsin always leads to coeluting labeled pep- 
tide pairs that are analyzed in the MS. We prepared spliceo- 
somes from HeLa NEs labeled with Lys+6, and Arg+10 
("heavy" NE) and spliceosomes prepared from NEs contain- 
ing the nonlabeled amino acids Lys+0 and Arg+0 ("light" 
NE) (Fig. lA). We had previously verified that the different 
SILAC NE preparations did not display significant differences 
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44/4 


1.18 


7.47 


127/17 


14.45 


B 


RBM42 


50.3 


gi|21359951 


12.35 


2/— 






— /— 




2.00 


2/1 




B 


THRAP3 *^ 


108.6 


gi|4827040 


10.21 


5/11 


9.39 


1.62 


30/4 


0.60 


3.58 


68/19 


0.17 


B 


UBL5 


8.5 


gi|13236510 




— /— 




13.56 


1/2 


4.92 


OSB 


12/— 




B 


HsKin17 


45.2 


gi 13124883 


1.33 


7/5 


0.10 


3.04 


6/8 


0.08 


21.00 


21/1 




B 


Npw38BP 


70.0 


gi|7706501 


3.53 


2/2 


0.60 


8.04 


6/17 


1.04 


24.00 


24/1 




B 


Npw38 


30.5 


gi 174735456 


3.55 


—/I 




7.68 


— /3 




OSB 


6/— 




B 


FUSE3 


61.7 


gi|1 0081 6392 


6.08 


6/3 


0.21 


3.48 


4/17 


0.43 


3.00 


10/1 




B 


PUF60 


54.0 


gi 109087698 


6.45 


7/3 


0.32 


3.30 


1/19 


1.36 


OSB 


9/— 




B 


RBM5/LUCA15 


56.12 


gi|62087206 


12.27 


7/— 




5.86 


2/2 


1.23 


OSB 


6/5 




B 


SAFB-like 


115.4 


gi 62244004 


8.21 


2/1 


3.81 


4.10 


5/— 




12.00 


12/1 




B 


SFRS12^^^ 


59.4 


gi 28703790 


10.08 


—/I 




4.60 


1/6 


2.55 










SPF30 (HB 


26.7 


gi|5032113 


10.08 


— /5 




30.16 


1/— 




OSB 


5/— 






Step 2 factors 


























hPRP22 *^ 


139.3 


gi|4826690 


0.12 


49/48 


0.03 


0.21 


55/11 


0.11 


0.05 


13/257 


0.01 


C 


hPRP18 •'^ 


39.9 


gi 4506123 


0.10 


—/I 




0.39 


1/4 


0.01 


OSC 


— /5 




C 


hPRPI 7 


65.5 


gi 7706657 


0.26 


28/29 


0.04 


0.32 


32/57 


0.19 


0.34 


31/91 


0.12 


c 


hPRP16 


140.5 


gi|1 7999539 


0.48 


— /2 




0.43 


—/1 2 




0.17 


1/6 




c 


hSLU7 *^ 


68.4 


gi 27477111 


0.32 


6/14 


0.11 


0.15 


19/38 


0.07 


OSC 


— /81 




c 


C complex proteins 


























Abstrakt *^ 


69.8 


gi|21071032 


0.16 


25/24 


0.05 


0.12 


48/90 


0.06 


0.02 


3/1 64 


0.01 


c 


CCIP p29 *^ 


28.7 


gi 46371998 


0.09 


11/18 


0.09 


0.18 


11/7 


0.00 


0.04 


2/45 




c 


DDX35 


78.9 


gi 20544129 


0.06 


13/15 


0.01 


0.25 


28/22 


0.11 


0.11 


7/64 


0.05 


c 
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MVV 
















Spectral 






Complex 


Protein 




Accession no. 


Ctt AC 


U 

if 


jtuev 


iTRAQ 


# 


StDev 


count 


# 

ff 


jtuev 


assignment 


Q9BRR8 


103.3 


gi|74732921 


0.05 


— /7 




0.29 


3/9 


0.15 


0.04 


1/23 




C 


c19orf29 


88.6 


gi|126723149 


0.05 


— /21 




0.17 


21/35 


0.00 


0.01 


1/76 




C 


(NY-REN-24) 


























PPIase-like 3b 


18.6 


gi|19557636 


0.02 


2/7 


0.01 


0.24 


8/3 


0.10 


OSC 


— /21 




c 


PPWD1 *^ 


73.6 


gi 124308049 


0.08 


18/18 


0.00 


0.15 


39/53 


0.07 


0.01 


1/81 




c 


MORGl 


34.3 


gi|153791298 


0.06 


2/5 


0.02 


0.39 


1/3 


0.42 


0.07 


1/14 




c 


FRGl 


29.2 


gi|4758404 


0.22 


3/2 


0.00 


0.58 


— /6 




OSC 


—/II 




c 


NOSIP*'^ 


33.2 


gi 7705716 


0.10 


—11 




0.15 


9/3 


0.05 


OSC 


— /21 




C 


CPKOW *'^''"' ^ 


52.1 


gi|15811782 


0.21 


SIM 


0.1 7 


0.66 


6/— 




0.27 


4/15 


0.21 


c 


C1orf55 


39.3 


gi|148664216 


0.12 


Ills 


0.10 


0.10 


19/14 


0.03 


OSC 


— /41 




c 


FAM32A 


13.1 


gi|7661696 


0.07 


1/4 


0.07 


0.11 


11— 




0.09 


1/11 




c 


Tip-49 


50.2 


gi 4506753 


0.13 


— /— 






—1— 




0.25 


111 




c 


PPIC 


88.5 


gi 142560244 


0.37 


2/5 


0.24 


0.20 


bl— 




0.12 


2/17 




c 


FAM50A 


40.1 


gi 14758220 


0.09 


1/7 


0.09 


0.22 


15/5 


0.10 


0.03 


1/36 




c 


FAM50B 


38.6 


gi 6912326 


0.05 


—11 




0.25 


2/3 


0.16 


OSC 


—/1 9 




c 


C9orf78 *^ 


33.7 


gi 7706557 


0.03 


111 


0.01 


0.20 


6/2 


0.09 


OSC 


—/1 6 




c 


C10orf4 


37.5 


gi 124432067 


0.10 


—IS 




0.10 


2/6 


0.01 


0.09 


1/11 




c 


CXorf56 *^ 


25.6 


gi|11545813 


0.13 


3/8 


0.08 


0.16 


12/11 


0.11 


0.02 


1/51 




c 


DCCR14 


52.4 


gi|13027630 


0.14 


8/8 


0.16 


0.19 


3/8 


0.14 


OSC 


— /24 




c 


CCDC130 


44.7 


gi|13540614 


0.10 


—11 






—1— 




OSC 


—17 




c 


NKAP 


47.0 


gi|13375676 


0.06 


1/5 


0.05 


0.22 


bi- 




OSC 


—/1 4 




c 


ZCCHC10 


18.4 


gi|8923106 


0.12 


111 


0.06 


0.66 


ll- 




OSC 


—/I 7 




c 


CDK10 


35.4 


gi|16950647 


0.10 


—lb 




0.22 


3/4 


0.03 


OSC 


—19 




c 


TTC14 


88.2 


gi|33457330 


0.21 


1/8 


0.19 


0.27 


8/11 


0.07 


OSC 


—/1 9 




c 


NFKBIL1 


43.1 


gi|26787991 


0.08 


—1— 






—1— 




0.20 


1/5 




c 


NY-CO-10 ^ 


53.8 


gi|64276486 


0.51 


8/8 


0.26 


0.66 


Mil 


0.03 


0.67 


10/15 


1.90 


c 


K1AA1604 ^ 


105.5 


gi|55749769 


0.19 


17/26 


0.04 


0.40 


IS in 


0.08 


0.27 


28/1 03 


0.18 


c 


DDX34 


128.1 


gi|381 58022 


0.10 


—IS 




0.37 


7/20 


0.12 


OSC 


— /24 




c 


NUFIPl 


56.4 


gi|6912542 


0.04 


1/1 


0.02 


0.28 


1/2 


0.14 


0.42 


2/5 


0.12 


c 


PRKRIP1 


21.0 


gi|13375901 


0.12 


4/6 


0.07 


0.12 


7/4 


0.02 


OSC 


—/1 3 




A c 


EJC/mRNP 


























elF4A3 ^ 


46.9 


gi|7661920 


0.24 


18/21 


0.03 


0.15 


31/— 




0.32 


33/1 04 


0.10 


c 


Magoh 


17.2 


gi 4505087 


0.24 


6/3 


0.04 


0.18 


10/5 


0.04 


0.20 


5/25 


0.08 


c 


Y14 *c 


19.9 


gi 4826972 


0.15 


111 


0.03 


0.25 


7/2 


0.20 


0.46 


6/13 


0.39 


c 


Pinin 


81.6 


gi|33356174 


0.77 


1/4 


0.65 


2.00 


5/3 


0.15 


4.25 


17/4 


0.35 


B 


UAP56 


49.1 


gi 18375623 


3.18 


5/4 


4.10 


3.28 


11/3 


1.79 


1.53 


23/15 


0.82 


B 


SR-related proteins 


























SRmieO 


102.5 


gi|42542379 


0.32 


3/3 


0.25 


1.12 


SI— 




1.50 


3/2 


0.71 




SRm300 


300.0 


gi|4759098 


0.33 


13/26 


0.10 


0.97 


7/4 


0.93 


0.12 


10/85 


0.32 




SR proteins 


























SF2/ASF 


27.8 


gi|5902076 


3.52 


44/23 


2.69 


1.63 


14/8 


1.17 


0.96 


50/52 


0.49 




9G8 ^' ^ 


27.4 


gi 172534660 


2.94 


14/19 


1.46 


1.87 


24/9 


1.00 


1.51 


1 1 6/77 


0.58 




SRp20 


19.4 


gi|4506901 


6.02 


111 


4.42 


2.09 


8/1 


0.62 


1.52 


35/23 


0.77 




SRp30c 


25.5 


gi 4506903 


1.32 


16/18 


0.02 


0.76 


^bl^ 


0.33 


1.17 


42/36 


0.53 




SRp38 ^ 


31.3 


gi 5730079 


0.95 


6/— 




1.18 


15/7 


0.38 


0.67 


41/61 


0.04 




SRp40 


31.3 


gi 3929378 


1.30 


10/10 


0.85 


1.34 


4/5 


0.70 


1.25 


35/28 


0.25 




SRp46 


31.2 


gi|15055543 


4.15 


—11 




1 .85 


11 — 




1.00 


4/4 






SRp55 


39.6 


gi|20127499 


1.49 


4/15 


0.12 


1.11 


17/7 


0.33 


1.18 


47/40 


0.07 




SRp75 


56.8 


gi|21361282 


5.93 


—11 




1.16 


—11 












hTra-2 alpha 


32.7 


gi|9558733 


3.61 


15/6 


1.01 


2.12 


9/2 


0.97 


1.63 


31/19 


0.76 




hTra-2 beta ^ 


33.7 


gi 4759098 


4.75 


2/16 


1.10 


1.30 


22/9 


0.10 


1.10 


45/41 


0.24 




hnRNP 


























hnRNP Al 


38.7 


gi 14504445 


12.52 


—/1 2 


0.56 


4.32 


6/2 


1.25 


11.00 


22/2 


8.49 




hnRNP A3 *^ 


39.6 


gi|34740329 


5.43 


4/3 


0.57 


4.88 


6/1 


0.37 


17.00 


17/1 






hnRNP A2/B1 


37.4 


gi 114043072 


6.10 


1/8 


4.49 


4.11 


6/4 


0.73 


7.00 


21/3 






hnRNP C**^' 


33.3 


gi 14758544 


1.51 


41/28 


0.24 


1.84 


23/16 


0.17 


1.35 


77/57 


0.74 




hnRNP D 


38.4 


gi 114110420 


10.08 


111 


0.46 




—1— 




4.00 


4/1 
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B:C protein ratios 



Protein 


MW 
[kDa] 


Accession no. 


SI LAC 


it 


StDev 


iTRAQ 


# 


StDev 


Spectral 
count 


# 


Complex 
StDev assignment 


nnKINr r 


43./ 


gl 1 1 4f54/U4Ub 


J.OD 


— /z 




T OA 


— / 1 




U./ J 


T /A 




hnRNP C 


47.4 


gi|56699409 


3.52 


19/11 


0.32 


1.87 


32/6 


0.49 


1.29 


49/38 


1.51 


hnRNP HI 


49.1 


gi|5031753 


2.64 


3/6 


0.01 


1.56 


4/13 


0.40 


1.25 


5/4 




hnRNP K 


51.0 


gi|14165435 


12.05 


13/— 






1/2 


0.74 


OSB 


24/— 




hnRNP M 


77.5 


gi 14141152 


3.56 


8/1 


1.04 


3.49 


8/4 


0.04 


3.00 


39/13 


5.36 


hnRNP Q 


69.6 


gi 15809590 


2.49 


3/— 




1.34 


— /— 




1.89 


17/9 


0.24 


hnRNP R 


70.9 


gi|5031755 


1.05 


12/4 


0.25 


0.98 


8/1 


0.15 


1.44 


23/16 


0.88 


hnRNP U *^ 


90.6 


gi|14141161 


11.24 


2/2 


2.24 


3.86 


10/— 




2.00 


8/4 


0.71 


PCBP1 *^ 


37.5 


gi|5453854 


5.42 


11/2 


0.14 


4.84 


5/10 


0.26 








PCBP2 


38.1 


gi|14141166 


4.47 


12/2 


1.37 


3.15 


4/16 


0.80 


2.74 


52/19 


1.54 


RALY 


32.5 


gi|8051631 


1.25 


— /8 




1.64 


4/3 


1.19 


0.56 


10/18 


0.47 



The average B:C protein ratio of two biological replicates after SILAC, iTRAQ, or spectral count quantification is shown. For SILAC and iTRAQ 
quantification, the number of peptide ratios (#) used for quantification is given for both biological replicates (1st replicate/2nd replicate). For 
spectral count, the sum of spectra from both biological replicates for B and C complexes is given (sum spectra B/sum spectra C). For all quan- 
tification methods, the standard deviation between the two biological replicates is provided. Note that if no spectra were acquired in one of 
the complexes, no standard deviation can be calculated for spectral count analysis. The proteins were classified as specific to complex B or to 
complex C, or as part of the spliceosomal core. Proteins that lacked an assignment could not be quantified by the respective approach. 
Proteins quantified by only one approach are not shown. For proteins identified solely in the B or C complex (OSB, only spectra B complex; 
OSC, only spectra C complex), no B:C protein ratio could be calculated by spectral count. The proteins are assigned according to the method 
of Bessonov et al. (2008) and have been regrouped on the basis of our quantification results. Proteins labeled with an asterisk are inferred to 
be major components of the human spliceosome and to be abundant in complexes A, B, C, or B^" according to Agafonov et al. (2011). 
Proteins highlighted in gray were added to the list of spliceosomal B and C complex proteins according to results from this study. Protein ac- 
cession numbers were observed from NCBInr database. 



in their overall protein abundance, as shown by LC-MS/MS 
analyses of peptides obtained from equivalent concentrations 
of the "light" or "heavy" NE proteins hydrolyzed in gel 
(Nikolov et al. 2011). 

In order to ensure that spliceosomal complexes isolated 
from "light" and "heavy" NEs do not exhibit differences in 
their activity, we monitored the splicing kinetics (Fig. 2A) 
and spliceosomal complex formation (Fig. 2B) in "light" 
and "heavy" SILAC NEs, as well as the RNA compositions 
of the purified B and C complexes (Fig. 2C). The analyses re- 
vealed that catalytically active spliceosomes form in exactly 
the same manner from "light" and "heavy" SILAC NEs, 
with no significant quantitative differences in the overall pro- 
tein composition or functional pre-mRNA splicing activities 
(Fig. 2). We then isolated spliceosomal B complexes from 
"light" SILAC NEs and C complexes from "heavy" SILAC 
NEs (Fig. lA), as described elsewhere (Bessonov et al. 2008). 
Proteins from the B and C complex mixtures were then sepa- 
rated by ID SDS-PAGE and analyzed. We identified and 
quantified 266 proteins (see Supplemental Fig. 1); this includ- 
ed all of the previously published spliceosomal proteins from 
the B and C complexes, with the exception of the GIO, UBL5, 
RACKl, TOEl, RBM7, TH0C3, and SC35 proteins (Supple- 
mental Table 2). Additionally, we identified and quantified 91 
proteins that in previous studies had not been found to be 
components of the spliceosome and that in most cases repre- 
sent contaminating proteins commonly detected in large- 
scale proteomic studies (Supplemental Table 3). 



Chemical labeling of spliceosomal proteins 
with iTRAQ reagents 

We also applied chemical labeling with stable isotopes on pu- 
rified spliceosomal B and C complexes using iTRAQ reagents 
(Ross et al. 2004). Chemical labeling is widely accepted as a 
quantitative MS-based method to analyze proteins derived 
from different samples. iTRAQ labeling of peptides from pu- 
rified spliceosomal B and C complexes was performed with a 
method most recently established after gel separation of pro- 
teins as described previously (Schmidt and Urlaub 2009; 
Schmidt et al. 2013). Extracted peptides from the B complex 
were labeled with iTRAQ- 11 5 and from the C complex with 
iTRAQ- 116. Our iTRAQ analysis allowed us to quantify 
almost all previously published spliceosomal proteins from 
B and C complex preparations (Bessonov et al. 2008), ex- 
cept for the proteins Ul-C, RACKl, Tip-49, CCDC130, 
NFKBILl, THOC3, DBPA, RBM42, and SC35 (Supplemen- 
tal Table 4). Similarly to the SILAC results, we found a total of 
265 proteins; among these were 87 proteins that had not been 
listed in the previous analysis by counting peptides; these 
mainly represented commonly encountered contaminants 
(Supplemental Table 5). 

Proteome analysis: spectral count 

Previous analyses of protein components of spliceosomal 
complexes used the numbers of sequenced peptides in each 
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FIGURE 1. Purification of the spliceosomal B and C complexes for 
proteomic analysis and iTRAQ and SILAC quantification. (A) B and 
C complexes were purified from "heavy" or "light" SILAC NEs, respec- 
tively. The complexes were allowed to assemble onto MS2-tagged pre- 
mRNA for 6 or 1 80 min, respectively. The complexes were then isolated 
by gradient centrifugation and affinity purification; isolated complexes 
were pooled in equal amounts; the proteins were separated by gel elec- 
trophoresis; and the peptides generated were analyzed by LC-MS/MS. 
{B) B and C complexes were purified from "normal" (light) NE. The 
proteins were separated by gel electrophoresis, and after in-gel digestion, 
the peptides generated were analyzed by LC-MS/MS for proteomic anal- 
ysis, or the peptides generated from the B complex were labeled with 
iTRAQ reagent 115 and those generated from the C complex were la- 
beled with iTRAQ reagent 116. After pooling, the samples were analyzed 
by LC-MS/MS (iTRAQ quantification). 



sample to estimate the quantities of the proteins in their re- 
spective complexes (Deckert et al. 2006; Behzadnia et al. 
2007; Bessonov et al. 2008, 2010; Fabrizio et al. 2009; 
Herold et al. 2009; Agafonov et al. 2011). As in these studies 
spliceosomal complexes were purified to the highest stan- 
dards in biochemical terms (i.e., kinetic splicing assays, 
density gradient centrifugation to obtain homogenous popu- 
lations), the number of peptides sequenced in the subsequent 
MS analysis was considered to be a valid approach for the es- 
timation of amounts of proteins in various complexes. 
Nonetheless, a certain ambiguity remained in these studies, 
namely, whether the peptide numbers and the respective 
comparison with numbers obtained from other complexes 
meet the requirements of quantitative protein analysis. As 
we have now obtained quantitative values for the proteins pre- 
sent in the spliceosomal B and C complex, we set out to com- 
pare these values with MS data obtained from a previous data 
set (Bessonov et al. 2008). Instead of peptide count, we used 



the spectral count from the same data set of Bessonov et al. 
(2008) with the software Scaffold 2 (Supplemental Table 5). 
The ratios of the spectra or peptides of B-complex proteins 
to C-complex proteins are listed in Supplemental Table 6 
(note that ratios could not be assigned to the proteins that 
were exclusively present in either the B or the C complex; 
these were labeled as "OSB" [only spectra B complex] or 
"OSC" [only spectra C complex]). This comparison revealed 
that assigning a protein abundance on the basis of peptide 
counting gives results consistent with those of spectral count- 
ing, with differences observed for only a few proteins (such as 
RACKl andPinin) (Supplemental Table 6). Moreover, we ob- 
served, when compared with the values obtained from the la- 
beling experiments, good agreement in the quantification of 
the various proteins in the different complexes, so that it 
can be concluded — with a few exceptions that are discussed 
below — that spectral count (or even peptide count) is an ap- 
propriate method to quantify proteins in various spliceoso- 
mal complexes. 

Validation of the quantification results 

The investigated spliceosomal B and C complexes provide an 
ideal system for validation of the obtained quantitative re- 
sults. The expected theoretical B:C protein ratio for the 
cap-binding proteins (which interact with the 5' cap structure 
of the pre-mRNA) is 1:1; for the Sm proteins (which are core 
subunits for all of the U snRNPs except U6), 2:1 (since the C 
complex contains the U2 and U5 but lacks the Ul and U4). 
Indeed, we observe a 1:1 ratio for the CBP20 and CBP80 
cap-binding proteins in the B:C complexes by all three 
methods (SILAC, iTRAQ, and label-free spectral count), 
with the exception of the spectral count for CPB20, which 
gave a ratio of 0.67 (Supplemental Tables 2, 4, 6). For the 
B:C protein ratios for the Sm proteins, both iTRAQ and 
SILAC yielded an average protein ratio of 1.75, which is close 
to the expected value of 2 (Fig. 3A). However, spectral count 
gave the expected value only for the SmF and SmG proteins 
and showed clearly lower ratios for the other Sm proteins 
(Fig. 3A). QveraU, the quantification of these proteins pro- 
vided an internal validation of the different quantification 
methods. 

A stable "core" of U snRNP-specific proteins 

The snRNAs of U2 and the U5 snRNPs remain stably associ- 
ated with the B and C complexes; however, only a subset of 
the U2- and US-specific proteins appears in equal amounts 
within the two complexes (Fig. 3B,C). SILAC, iTRAQ, and 
spectral count showed that the U2-A' and U2-B" proteins 
had B:C ratios of approximately 1, whereas the U2 snRNP-as- 
sociated splicing factors SF3a and SF3b were more abundant 
in the B complex (Table 1; Fig. 3B). Likewise, only four of the 
eight U5 snRNP proteins were found to be present in a 1:1 
ratio between the two complexes (220K, 200K, 116K, and 
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nov et al. 201 1). Of note, to draw conclu- 
sion from the absolute number of the 
values, that means, whether, for example, 
an extremely high or low value (e.g., Ul- 
A, 25.95 [SILAC]; PPIase-like 3b, 0.02 
[SILAC] ) compared with a moderate val- 
ue (e.g., hPrpl9, 0.36 [SILAC]; MFAP, 
2.65 [SILAC]) reflects the complete pres- 
ence or absence in B or C complex, re- 
spectively, cannot be unambiguously 
addressed. 
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FIGURE 2. MetabolicaUy labeled NEs retained full catalytic activity, as shown by analyzing the B 
and C complexes purified from "heavy" or "light" SILAC NEs, respectively. (A) The splicing ki- 
netics were determined from aliquots of splicing reactions taken from 0-180 min and analyzed by 
denaturing gel electrophoresis. Pre-mRNA and splicing products were visualized by autoradiog- 
raphy. Splicing products first appeared after 10 min. (B) The spliceosomal complex formation was 
assayed by native agarose gel electrophoresis and visualized by autoradiography. The A and B 
complexes were first observed after 2 and 4 min, respectively, while the C complex first appeared 
after 10-15 min. (C) The RNA compositions of purified B ("light" SILAC NE) and C ("heavy" 
SILAC NEs) complexes were analyzed by denaturing gel electrophoresis and visualized by silver 
staining (lanes 1,3) or autoradiography (lanes 2,4). B complexes contained Ul, U2, U4, U5, and 
U6 snRNA (lane 1) and a large amount of pre-mRNA (lane 2). C complexes contained U2, U5, 
and U6 snRNA (lane 3) and splicing products and reduced amounts of pre-mRNA (lane 4). 



40K) (Fig. 3C); we therefore consider the proteins U2A', 
U2B", U5-220K, 200K, 116K, and 40K together with the 
Sm proteins of U2 and U5 snRNPs, which are also stably as- 
sociated with both complexes (see above. Fig. 3A), to be the 
spliceosomal "core" proteins in the B and C complexes. 

Specific proteins of the B and C complexes 

Once we had validated our quantification results as described 
above, we next used the quantitative values of the protein 
abundances to investigate the correlation of their association 
with the B and C complexes. To do this, we consider proteins 
that show a B:C value above 2.0 for at least two of the 
three methods to be specifically associated with the B com- 
plex and those showing a value below 0.5 for at least two of 
the three methods to be specifically associated with the C 
complex. We also take the previous ordering and grouping 
of proteins into account (Bessonov et al. 2008, 2010; Agafo- 



Proteins predominantly associated with 
spliceosomal B complexes 

During the transition from the B to the C 
complex, the Ul and U4 snRNPs with 
their associated proteins are destabilized 
and dissociate; this is clearly reflected 
by the high B:C ratios observed for these 
proteins (Table 1; Fig. 4A,B). The U4/ 
U6 snRNP-specific proteins showed very 
high B:C ratios (average B:C 10.54), as de- 
termined by all three approaches (Fig. 
4B), showing that they, together with 
the U4 snRNA, also dissociate from the 
spliceosome during the transition from 
the B to the C complex. All three methods 
revealed high B:C ratios (>3) for the U6 
snRNP LSm proteins, showing that, al- 
though the U6 snRNA remains associated 
with the C complex, the LSm proteins dis- 
sociate from U6 during the B-to-C transi- 
tion (Table 1; Fig. 4C). Contrary to the 
U2A' and U2-B" proteins (see above), 
the U2 snRNP-specific SF3a and SF3b 
splicing factors were found to be more 
abundant in the B complex (Fig. 3B) 
and thus, however, do not seem to belong in the category of 
U snRNP "core" proteins of the B and C spliceosomes. 
Likewise, U5 snRNP-specific proteins 15K, 52K, lOOK, and 
102K show high B:C ratios as obtained by all three methods 
and thus represent B-specific proteins (Fig. 3C). In addition, 
all proteins specific to the tri-snRNP (U4/U6.U5) showed 
high B:C ratios (Fig. 4B; Table 1); this is consistent with pre- 
vious studies that have shown dissociation of some tri- 
snRNP-specific proteins from the spliceosome during its ac- 
tivation (Makarov et al. 2002). 

For several non-snRNP proteins high B:C ratios were ob- 
served, showing that they are more abundant in B-complex 
preparations. Some examples are RED (average B:C 7.39), h- 
Smu-1 (average B:C 6.12), and UBL5 (B:C 13.56 [iTRAQ]). 
By using our quantification methods, we classify these pro- 
teins as specific for the B complex (hPrp38, hSnu23, 
TFIPll, MFAPl, RED, hSmu-1, RBM42, TRAP3, UBL5, 
HsKinl7, Npw38, Npw38BP) (Table 1; Fig. 5A). In addition 



www.rnajournal.org 41 3 



Schmidt et al. 




SmB SmDI SmD2 SmD3 SmE SmF SmG 
■ SILAC iTRAQ ■ spectral count 



B 



U2 snRNP 



10.0 



5 2.0 



1.0 
0.8 



iJ. 




< to 

3 B 



I SILAC iTRAQ ■ spectral count 



0 10.0 

'6 

e 

1 4X) 



U5 snRNP 



1.0_ X j .L-i TT i 




220K 200K 116K 40K 102K 15K lOOK 52K 
■ SILAC ITRAQ ■ spectral count 



FIGURE 3. Relative protein abundances of the Sm proteins and the U2 
and U5 snRNP-specific proteins obtained by spectral count, SILAC, and 
iTRAQ. The B:C ratios are shown for the Sm proteins (A), the U2 
snRNP-specific proteins (B), and the U5 snRNP specific proteins (C). 
Ratios of proteins in B versus C complex are plotted on a logarithmic 
scale; error bars, SD between the two biological replicates. " 1 " indicates 
that a protein is present in these complexes in a 1 : 1 ratio. The different 
shading of the bars represents the ratios of proteins in B and C complex- 
es derived from the values obtained by SILAC, iTRAQ, and spectral 
count, respectively (see Table 1; Supplemental Tables). 



to these proteins, we thus extend the list of B-specific proteins 
by six proteins (FUSE3, PUF60, RBM5/LUCA15, SAFB-like, 
SFRS12, and SPF30) (Table 1), which were not present in 
the protein lists of previous studies (Bessonov et al. 2008; 
Agafonovet al. 2011). 



Proteins predominantly associated with C complexes 

The so-called step 2 splicing factors are required for the sec- 
ond step of pre-mRNA splicing, which occurs in the C com- 
plex (for review, see Umen and Guthrie 1995; Smith et al. 
2008), and accordingly, those proteins should be more abun- 
dant in this complex. Indeed, all of these proteins (hPrpl7, 
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FIGURE 4. Relative protein abundances in the B and C complexes for 
the Ul, U4/U6, and U4/U6.U5 snRNP-specific proteins and the LSm 
proteins, as obtained by SILAC, iTRAQ, and spectral count. (A) B:C ra- 
tios for Ul snRNP-specific proteins. No protein ratios were obtained for 
Ul-C from the spectral count since this protein was completely absent 
from the C complex. (B) B:C ratios for the U4/U6- and U4/U6.U5-spe- 
cific proteins. (C) B:C ratios for the LSm proteins. No protein ratio was 
obtained from the spectral count for LSm4 since this protein was absent 
from the C complex. LSm5 was not identified or quantified. For details, 
see legend to Figure 3. 
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FIGURE 5. Relative protein abundances of the B- and C- specific proteins. {A) B:C ratios for B- 
specific proteins. (B) B:C ratios for C-specific proteins. For several proteins, no values for spectral 
count were obtained as these proteins had spectra only in the B (hSnu23, UBL5, Npw38, SPF30) 
or the C (PPIL3b, FRGl, NOSIP, ClorfSS, FAM50B, C9orfy8, DGCR14, CCDC130, NKAP, 
ZCCHCIO, CDKIO, TTC14, DDX34, PRKRIPl) complex. For details, see legend to Figure 3. 



et al. 2000). Together with U5 snRNP, it 
forms the remodeled 35S U5 complex in 
humans (Makarovet al. 2002), and recent 
studies in yeast found it in the 35S ILS 
intron-lariat spliceosome (Fourmann 
et al. 2013). Here, we found that aU of 
these proteins (hPrpl9, CDC5L, SPF27, 
PRLl, Hsp70, AD-002)— with the excep- 
tion of CTNNBLl, which showed an av- 
erage B:C ratio of 1.34 (Table 1) — ^were 
clearly more abundant in the C complex 
than the B complex (Table 1; Fig. 6B). 
The Npw38 and Npw38BP proteins 
were previously found to comigrate at 
the top of the gradient during hPrpl9/ 
CDC5L complex purification, suggesting 
that they were co-isolated and/or not sta- 
bly associated with the hPrpl9/CDC5L 
complex (Makarova et al. 2004). These 
two proteins were found here to be highly 
abundant in the B complex compared 
with the C complex (Table 1; Fig. 5A), 
demonstrating their predominant associ- 
ation with the B but not with C complex. 

The RES complex 



hPrplS, hPrp22, hPrpl6, and hSlu7) had B:C ratios between 
0.1 and 0.5 in iTRAQ and SILAC and close to 0 by spectral 
count, showing that they were clearly associated with the cat- 
alyticaUy active spliceosomal C complex (Fig. 6A; Table 1). 

Additional proteins have previously been classified as 
"proteins recruited to the C complex" and "potential C com- 
plex-specific proteins" (Bessonov et al. 2008). AU of these 
proteins revealed low B:C ratios by all three quantification 
methods (Table 1; Fig. 5B; Supplemental Tables 2, 4, 6). 

On the basis of these quantification results, we extended 
the list of C-specific proteins by three proteins (DDX34, 
NUFIPl, and PRKRIPl) (Table 1), all of which showed 
high abundance in the C complex, with B:C ratios between 
0.04 and 0.37. The DDX34 protein, which was not previously 
identified within any spliceosomal complex, was clearly iden- 
tified as being more strongly represented in the C complex by 
both iTRAQ and SILAC. Spectral-count analysis identified 
this protein solely in the C complex but not in the B complex. 

The hPrpl9/CDC5L complex 

The hPrpl9/CDC5L (NTC in yeast, Chan et al. 2003) complex 
is essential for pre-mRNA splicing and associates with the 
spliceosome before the first catalytic step of splicing (Ajuh 



The RES (retention and splicing) com- 
plex consists of SNIPl, MGC12135, and 
CGI- 79. It binds to the spliceosome be- 
fore the first step of splicing and is re- 
quired for efficient intron removal and 
nuclear pre-mRNA retention (Dziembowski et al. 2004). 
We found by all three quantification methods that the 
hRES proteins associate with both the B and C complexes, 
showing a 1:1 B:C ratio (Table 1). Indeed, this is the only 
non-snRNP protein complex among the quantified non- 
snRNP splicing factors that remains constantly associated 
in the B and C complex. 

The exon junction complex 

The exon junction complex (EJC) protein complex binds 
spliced mRNAs in a sequence-independent manner close to 
site of exon-exon ligation. It is a highly dynamic complex 
with stably and more weakly associated protein components 
(Le Hir et al. 2000; Lau et al. 2003; Merz et al. 2007; Singh 
et al. 2012). We found that the EJC did not show the same 
B:C ratios for all its proteins. Only for three proteins 
(eIF4A3, Magoh, and Y14), we observed low B:C ratios 
(~0.25) (Table 1; Fig. 6A), showing that they are much more 
abundant in the C complex, while proteins UAP56 and 
Pinin are much more abundant in the B complex (B:C ratios 
of ~2.5) (Table 1). Other proteins previously assigned to the 
EJC showed fluctuating protein ratios (Supplemental Table 
1), which did not allow us to classify them unambiguously. 
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FIGURE 6. Relative protein abundances of the step 2 factors, the EJC 
and the hPrpl9/CDC5L complex-specific proteins obtained by 
SILAC, iTRAQ, and spectral count. (A) B:C ratios for step 2 factors 
and the EJC. No protein ratio was obtained from the spectral count 
for hPrplS since this protein was absent from the B complex. (B) B:C 
ratios for the hPrpl9/CDC5L complex-specific proteins. For 
CTNNBLl, no clear association to either the B or the C complex could 
be determined. For details, see legend to Figure 3. 

DISCUSSION 

In this study, we utilize three independent MS quantification 
techniques to compare the proteomes of stages of the spHceo- 
somal complex during its transition from a precatalytic to a 
catalytically active state. The various functional states of the 
spliceosome provide an ideal system to investigate quantita- 
tive MS approaches on native and functionally active molec- 
ular assemblies. First, the amount of complexes to be 
compared can be precisely controlled by measuring the 
amount of radioactively labeled pre-mRNA in each complex; 
second, the functionality of the complexes can be monitored 
by corresponding assays. 

We found good overall agreement between results from the 
label-free spectral count and those from the two methods that 
use stable isotope labeling (SILAC and iTRAQ). The fact that 
our results are consistent among the different quantification 
approaches, in spite of the fact that we used different analytical 



conditions during the quantitative MS analysis (i.e., different 
MS instruments and liquid chromatography conditions; see 
Materials and Methods) underlines the reproducibility of 
the methods employed here. Moreover, as previously generat- 
ed data sets (Bessonovet al. 2008) were used to explore spectral 
counting as a label-free MS approach — and, importantly, gave 
results consistent with those of both the labeling approaches — 
this demonstrates that the evaluation of the protein abundance 
among various spliceosomal complexes as performed for A, B, 
B*"^', and C in human spliceosomes and B, B'"^*, B*, and C yeast 
spliceosomes is valid and indeed reflects the association of a 
distinct protein with its corresponding complexes (Deckert 
et al. 2006; Behzadnia et al. 2007; Bessonov et al. 2008, 2010; 
Fabrizio et al. 2009; Agafonov et al. 201 1). 

In agreement with previous comparative studies on MS- 
based quantification (Hendrickson et al. 2006; Collier et al. 
2010; Li et al. 2012), discrepancies between the isotope label- 
ing and the spectral count techniques were seen, especially for 
small proteins (<20 kDa, e.g., Sm proteins) (Table 1), for 
which accurate quantification was not achieved by spectral 
count owing to the limited number of peptides that were 
generated. The spectral count was also a little less accurate 
than labeling in quantifying proteins present in equal 
amounts within the B and C complexes, such as U5-220K 
(B:C 1.68), U5-40K (B:C 1.50), and CBP20 (0.79) (see Sup- 
plemental Tables 2, 4, 6). Although the two methods based 
on stable isotope labeling yielded similar results, iTRAQ 
showed overall lower values for proteins strongly represented 
in the B complex and showed slightly higher values for pro- 
teins that are strongly represented in the C complex, than 
SILAC did. This may be due to the fact that precursor selec- 
tion for MS/MS is not 100% selective, allowing reporter 
ions of co-eluting peptides to contribute to the iTRAQ report- 
er ion intensity of the peptide being analyzed (Bantscheff 
et al 2007). 

We compared our results with those from the previously 
published semi-quantitative analyses by Agafonov et al. 
(2011) and Bessonov etal. (2008) and found that all the stud- 
ies show the same results to a large extent. Remarkably, we 
found that most of the U snRNP-specific proteins are abun- 
dant in the B complex but not in the C complex. Here, we 
show that not only are Ul and U4 snRNP-specific proteins de- 
stabilized during the transition from complex B to complex C, 
but also U5 and U2 snRNP-specific proteins and LSm pro- 
teins. This leaves a U snRNP-specific protein "core" in the 
B and the C complex consisting of only the Sm proteins, 
four U5 snRNP-specific proteins (220K, 200K, 11 6K, and 
40K), and two U2 snRNP- specific proteins (U2A' and U2B"). 

Evaluation of our quantitative data also allowed us to 
classify non-snRNP proteins as being specific for the B com- 
plex (Table 1). We extended the list of B-specific proteins by 
six proteins (FUSE3, PUF60, RBM5/LUCA15, SAFB-like, 
SFRS12, and SPF30) (Table 1), which showed high B:C ratios 
by at least two of the three methods. However, we have to take 
into account the fact that these proteins might be residual 
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proteins belonging to the A complex, which was not included 
in our study. They might associate with the spliceosome at a 
very early point during pre-mRNA splicing and dissociate 
during formation of the catalytically active C complex. One 
example of this is the protein THRAP3, which is classified 
as B-specific by our results but was found to be specifically 
associated with the A complex by Agafonov et al. (2011). 
Along the same lines, we cannot make a clear statement re- 
garding the hnRNP proteins. Although these are more abun- 
dant in the B complex, we have no clear evidence as to 
whether they are B-specific or whether they are residual pro- 
teins from the H/A complex. 

More precise conclusions can be drawn for C complex pro- 
teins. The hPrpl9/CDC5L complex (Ajuh et al. 2000; 
Makarova et al. 2004) was found to be associated with the B 
complex but was shown to be more abundant in the C 
complex; we show here that the quantitative values are ap- 
proximately twice as great in the C as in the B complex, sug- 
gesting that this complex interacts loosely with the B complex 
and then becomes more stably associated with the C complex. 
Similarly, Agafonov et al. (2011) found the hPrpl9/CDC5L 
complex to be less abundant in the B complex but highly 
abundant in the B"'^* and C complexes. We did not analyze 
the intermediate complex B"^' in our study, and the slightly 
higher B:C ratio of the hPrpl9/CDC5L proteins might reflect 
the association of these proteins during the B-to-C transition. 
In contrast, the second-step factors (hPrpl6, hPrpl7, hPrplS, 
hPrp 22, and hSLU7) clearly show high abundance in the C 
complex with very low B:C ratios (~0.25), showing their asso- 
ciation with the activated C complex only. Only three EJC 
proteins (eIF4A3, Magoh, and Y14) were also found to be 
clearly associated with the C complex. Accordingly, we define 
other proteins showing very low B:C ratios as "C-specific pro- 
teins" (Table 1). Some of these proteins have already been 
found to be abundant in the C complex by Agafonov et al. 
(2011); however, we are now able to extend the number of 
C-complex proteins (e.g., cl9orf29, FAM32A, clOorf4, 
DDX34, NUFIPl) (Table 1). Importantly, DDX34 has never 
been described in the context of spliceosomal complexes. 
This protein is a probable ATP-dependent RNA helicase (gi| 
38158022), but its function has not yet been described, either 
in yeast or in humans. DDX34 thus represents a potential tar- 
get for future studies. 

Alongside the proteins that were clearly classifiable into B 
and C complex proteins, we identified some proteins that 
could not clearly be assigned owing to their inconsistent quan- 
tification values (Supplemental Table 1). For these proteins, 
different B:C ratios were obtained by the different quan- 
tification approaches (e.g., WDR70, SKIV2L2, DDX3) 
(Supplemental Table 1). There are several possible reasons 
why these proteins do not show a clear quantitative associa- 
tion. First, they may represent transiently bound proteins 
that easily dissociate from the complexes during purification 
and are thus present to different extents in the different com- 
plex preparations. Examples are the pre-mRNA/mRNA-bind- 



ing proteins (Supplemental Table 1), which showed very 
inconsistent quantitative trends and at least two of which 
(YB-1 and ASR2B) were found to be present in all spliceoso- 
mal complexes (A, B, B'"^^ and C) detected in each case at a dif- 
ferent abundance (Agafonov et al. 201 1). Second, they maybe 
components of the B"'^' complex, the intermediate complex in 
the B-to-C transition. We did not quantify the proteome of 
this complex, and the B:C ratios of proteins abundant in the 
B"^' complex fluctuate between different preparations and 
quantification approaches. Indeed, some of these proteins 
were found by Agafonov et al. (2011) to be specific for the 
B"^' complex, and a recent study in which the human B"*^' com- 
plex was analyzed confirms this assumption, as several of these 
proteins were shown to be abundant in the B'"^' complex only 
(Bessonovet al. 2010); examples are hPrp2, PPIL2, RNFl 13A, 
MGC20398, andMGC23918 (Supplemental Table 1). 

CONCLUSIONS 

We applied three independent MS-based quantification tech- 
niques to compare the proteomes of the precatalytic and the 
catalytically active spliceosomes (i.e., B and C complexes). 
Overall, we confirm, but also extend, results from previous 
studies that addressed the relative abundances of proteins 
in the respective complexes by semi-quantitative approaches. 
We have found that the label-free spectral count technique 
provided a suitable method for quantifying highly purified 
samples (such as the spliceosome or other RNPs). However, 
it has its limits when quantifying proteins of low molecular 
weight or small fold changes. This is the first report of 
SILAC used to label and subsequently purify a molecular ma- 
chine that was functionally as active as its nonlabeled coun- 
terpart (as shown here for the spliceosome in terms of 
splicing efficiency and kinetics). Thus, it may be applicable 
to monitoring the assembly kinetics of the spliceosome in 
short time-frames in order to address the dynamic incorpo- 
ration and release of proteins in its various functional states 
or to gain insight into the changes of protein modification 
during the splicing cycle. Overall, iTRAQ (or a similar ap- 
proach using isotope-labeled reagents) is easier to apply to 
the quantitative investigation of spliceosomal proteins, in 
particular as a relatively large amount of NE is required for 
the assembly of spliceosomes. Thus, a reliable quantification 
technique based on chemical labeling with stable isotopes 
would be of benefit for in-depth quantitative analyses of 
spliceosomes from sources with low quantities. 

MATERIALS AND METHODS 

Preparation of metabolically labeled "light" and 
"heavy" NEs 

Dulbecco's modified Eagle medium (DMEM; PAA Laboratories) 
lacking L-arginine and L-lysine was supplemented with 10% (v/v) 
dialyzed fetal bovine serum (PAA Laboratories), 1 x penicillin/ 
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streptomycin, and 50 mg/L of normal ("light") or stable-isotope-la- 
beled ("heavy") L-arginine and L-lysine. HeLa S3 cells were grown 
in custom-made DMEM containing either "light" or "heavy" L-ar- 
ginine and L-lysine. Cells were grown for at least six passages at 100 
mL (~0.5 X 10** to 1.0 x lO'^ cells/mL) in 200 mL spinner flasks. The 
cells were then expanded to higher volumes and transferred to a 2.5 
L fermenter. The cells were grown under standard conditions (50 
mg/L L-arginine and L-lysine) with continuous perfusion of medi- 
um (0.5-1.0 v/24 h). A total of 1.5 L (5 x lO*" cells/mL) of cells was 
harvested, and NEs were prepared according to the method of 
Dignam et al. (1983) from "light"- or "heavy"-labeled cells. L-argi- 
nine and L-lysine, and stable isotope-labeled L-arginine ('^C6^N4; 
Arg+10) and L-lysine ('^Q; Lys+6), were purchased from Sigma- 
Aldrich. 

Generation of aptamer-tagged pre-mRNA 
and in vitro splicing 

MS2-tagged PM5 pre-mRNA was generated as previously described 
(Deckert et al. 2006; Bessonov et al. 2008). In vitro splicing was per- 
formed in either "light" or "heavy" HeLa NE, using '^P-labeled, 
m^G(5')ppp(5')G-capped, and MS2-tagged PM5 pre-mRNA. 
RNA was separated on a 8.0 M urea-10% (v/v) polyacrylamide 
gel. Spliceosomal complex assembly was analyzed by native gel elec- 
trophoresis on a 2% (m/v) agarose gel. 

Affinity selection of spliceosomal B and C complexes 

Spliceosomal B and C complexes for proteomic analysis and SILAC 
and iTRAQ quantification were isolated as previously described 
(Bessonov et al. 2008). Briefly, ^^P-labeled, m^G(5')ppp(5')G- 
capped, and MS2-tagged PM5 pre-mRNA was incubated with 
MS2-MBP fusion protein. Spliceosomal complexes were allowed 
to assemble from "light" or "heavy" NEs for 6 min (B complex) 
or 180 min (C complex). For SILAC quantification, B complexes 
were assembled from "heavy" NEs, and C complexes were assem- 
bled from "light" NEs. Assembled complexes were separated on 
10%-30% (v/v) glycerol gradients, and 40-45S gradient fractions 
were subjected to affinity selection on amylose beads. 

Sample preparation and MS for SILAC quantification 

Three pmoles of affinity-purified B and C complexes was mixed in 
equal amounts according to the ^^P-labeled pre-mRNA. Proteins 
were separated by gel electrophoresis on a 4%-12% Bis-Tris precast 
gel (NuPAGE, Invitrogen) and stained with colloidal Coomassie 
blue. The entire gel lane was cut into 25 pieces, and proteins were 
digested in-gel as described previously (Shevchenko et al. 1996). 
Samples were redissolved in 10% (v/v) acetonitrile/0.15% (v/v) for- 
mic acid (FA) and analyzed on a CAP-LC system coupled to a Q- 
ToF Ultima mass spectrometer (Waters) or on an Agilent HP 
1100 series system coupled to a LTQ-Orbitrap XL (Thermo Fisher 
Scientific). 

Sample preparation and MS for iTRAQ or spectral 
count quantification 

Four pmoles each of affinity-purified B and C complexes were sep- 
arated by 8%/14% (v/v) SDS-PAGE, respectively, and stained with 



Coomassie blue. Entire gel lanes were cut into 60-70 slices. 
Proteins were digested in-gel as described previously (Shevchenko 
et al. 1996) except that 50 mM triethylammonium bicarbonate buff- 
er (TEAB, Sigma-Aldrich) instead of ammonium bicarbonate buffer 
was used for the iTRAQ preparation. For iTRAQ labeling, the ex- 
tracted peptides were dissolved in 20 |iL 100 mM TEAB buffer. 
Internal standards were prepared by mixing 5 [iL TEAB buffer 
with 5 (iL aliquots of samples generated from gel slices cut at the 
same height from both gel lanes. iTRAQ reagents were reconstituted 
at room temperature in 70 \iL ethanol per vial. iTRAQ reagent (5 |iL) 
was added to each sample, and samples were incubated at room 
temperature for 1 h with gentle mixing. Internal standards were la- 
beled with iTRAQ-114, and samples from the B and C complexes 
were labeled with iTRAQ- 115 and iTRAQ- 116, respectively. The re- 
maining iTRAQ reagent was quenched by adding 5 [iL of 50 mM 
glycine and incubating at room temperature for 30 min with gentle 
mixing. Samples to be compared, such as those containing the pep- 
tides generated from B and C complexes, were pooled with their rel- 
evant internal standards and then dried in a vacuum centrifuge 
(Schmidt and Urlaub 2009). For the MS analysis, iTRAQ or spectral 
count samples were dissolved in 10% (v/v) acetonitrUe with 0.15% 
(v/v) FA and subsequently analyzed on a Waters Q-TQF Ultima 
coupled to a Waters CAP-LC system. 

LC-coupled ESI MS-MS/MS on a Q-ToF mass 
spectrometer 

To analyze samples over the CAP-LC system coupled to the Q-ToF 
Ultima mass spectrometer (Waters), peptides were separated online 
by reversed-phase chromatography using 0.1% (v/v) FA as mobile 
phase A and 80% (v/v) acetonitrile/0.15% (v/v) FA as mobile phase 
B. The peptides were loaded onto a trap column (|j-Precolumn 
Cartridge, Acclaim PepMaplOO C18, 300 |im Ld. x 5 mm, LC 
Packings) and separated at a flow rate of 200 nL/min on an analytical 
column packed in-house (Cig, ReprosU, Maisch) with a gradient of 
7%-40% mobile phase B over 50 min. Eluted peptides were ana- 
lyzed directiy in the Q-ToF mass spectrometer in a data-dependent 
manner. MS scans were acquired for 1 sec followed by three MS/MS 
spectra for 3 sec, each with an ion-mass window set to 2.5 Da. The 
MS-to-MS/MS switch was set to 15 counts/sec, and the MS/MS-to- 
MS was set to an intensity below a threshold of 2 counts/sec. Charge 
state recognition was used to estimate the collision energy for the se- 
lected precursors. Scan time and InterScan time were set to 0.9 sec 
and 0.1 sec, respectively. Peak lists were generated from raw data 
by using MassLynx v4.0 software with the following settings: smooth 
window 4.00, number of smooth 2, smooth mode Savitzky-Golay, 
percentage of peak height to calculate centroid spectra at 80% 
with no baseline subtraction. 

LC-coupled ESI MS-MS/MS on a LTQ-Orbitrap 
mass spectrometer 

Samples were analyzed on an HP 1100 series system (AgUent) cou- 
pled to a hybrid Linear Ion Trap-Qrbitrap mass spectrometer 
(LTQ-Orbitrap XL, Thermo Fisher Scientific). Peptides were sepa- 
rated by online reversed-phase nanoflow chromatography, using 
0.1% (v/v) FA as the mobile phase A and 95% (v/v) acetonitrUe/ 
0.1% (v/v) FA as the mobile phase B. Peptides were loaded onto a 
trap column packed in-house (1.5 cm, 360 [im o.d., 150 ^m i.d.. 
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ReproSil-Pur C18-AQ, 5 [im, Dr. Maisch) and separated at a flow 
rate of 300 nL/min on an analytical CI 8 capillary column (30 cm, 
360 \iim o.d., 75 [xm i.d., ReproSU-Pur C18-AQ, 5 |im) with a gradi- 
ent of 0%-38% mobile phase B over 35 min. Eluted peptides were 
analyzed directly in the mass spectrometer (LTQ-Orbitrap XL; 
Thermo Fisher Scientific). The LTQ-Orbitrap was operated in a 
data-dependent mode. Survey fuU-scan MS spectra were acquired 
in the LTQ-Orbitrap (m/z 350-1400) with a resolution of 30 000 
at m/z 400, and an automatic gain control target of 5 x 10^. The 
five most intense ions were selected for CID (collision-induced dis- 
sociation) MS/MS fragmentation and detection in the linear ion 
trap, with previously selected ions dynamically excluded for 60 
sec. Singly charged ions and ions with unrecognized charge states 
were also excluded. Internal calibration of the Orbitrap was per- 
formed using the lock mass option (lock mass: m/z 445.120025) 
(Olsen et al. 2005). Mascot generic format (mgf) files were generat- 
ed from raw data using Mascot Daemon v2.2.2 (Matrix Science). 

Data analysis and quantification 

Peak lists generated were searched against NCBI nonredundant da- 
tabase (October 8, 2007; 5539442 sequences), by using Mascot 
v.2.2.04 as search engine. The mass accuracy filter used was 0.2 
Da for the parent and fragment ions for the Q-ToF mass spectrom- 
eter. For the Orbitrap mass spectrometer, this was 5 ppm for precur- 
sor and 0.5 Da for product ions. Peptides with no or at most two 
missed cleavage sites were defined as tryptic peptides. Carbamido- 
methylation of cysteines and oxidation of methionine residues 
were allowed as variable modifications. For SILAC and iTRAQ 
quantification, "heavy" arginine (Arg+10) and "heavy" lysine (Lys+6) 
and iTRAQ modifications, respectively, were allowed as fixed 
modifications. 

SILAC quantification 

SILAC quantification was carried out by using unique peptides with 
the MSQuant software vl.2. Data normalization was performed on 
proteins known to be present in a 1:1 ratio (e.g., the 5' pre-mRNA 
cap-binding proteins CBP20 and CBP80, the U5-220K and U5- 
200K proteins). 

ITRAQ quantification 

Non-normalized peptide ratios for iTRAQ quantification were ob- 
tained from Mascot v2.2.04 for unique peptides with a minimum 
peptide score of 20. Proteins were quantified from the main bands 
by calculating the mean ratio after manual removal of outliers. 
Data normalization was performed on proteins known to be present 
in a 1 : 1 ratio, as above. Protein ratios obtained were further validated 
by three independent procedures: (1) calculating the labeling effi- 
ciency for each protein in each band; (2) using the same amounts 
of nonmodified trypsin (Roche), resulting in a 1:1 ratio for autopro- 
teolytic trypsin peptides; and (3) analyzing peak intensities of the 
reporter ions for the internal standards (iTRAQ- 114) of low-scor- 
ing peptides. As the internal standard was prepared by pooling 
aliquots from iTRAQ- 115- and iTRAQ- 116-labeled samples the 
following equation represents the intensity ratios: intensity 
[iTRAQ-114] =1/3 intensity[iTRAQ-115] +1/3 intensity[iTRAQ- 
116] whereby [iTRAQ- 1 14] = (5 pL/(20 |aL-5 pL)) [iTRAQ 



-115] + (5 pL/(20 ^L-5 ^L)) [iTRAQ-116] (see also Schmidt 
and Urlaub 2009). 

Spectral count 

Unweighted spectral count for proteins identified in B and C com- 
plexes was obtained by using the software Scaffold 2. B:C protein ra- 
tios were calculated manually from the obtained number of spectra 
for each protein. 

SUPPLEMENTAL MATERIAL 

Supplemental material is available for this article. 
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